Models

	Provider		Types
	OpenAI	GPT 4o mini	Text	$0.15	$0.60	128K	Cheapest
	OpenAI	GPT 4o mini (2024 07 18)	Text	$0.15	$0.60	128K	Cheapest
	OpenAI	GPT 4o	Text	$2.50	$10.00	128K	Mid
	OpenAI	GPT 4o (2024 08 06)	Text	$2.50	$10.00	128K	Mid
	OpenAI	ChatGPT 4o	Text	$5.00	$15.00	128K	Expensive
	OpenAI	GPT 4o (2024 11 20)	Text	$2.50	$10.00	128K	Mid
	OpenAI	GPT 4.1	Text	$2.00	$8.00	1M	Mid
	OpenAI	GPT 4.1 Mini	Text	$0.40	$1.60	1M	Cheapest
	OpenAI	GPT 4.1 Nano	Text	$0.10	$0.40	1M	Cheapest
	OpenAI	GPT 3.5 Turbo	Text	$0.50	$1.50	16.4K	Cheapest
	OpenAI	o1	Text	$15.00	$60.00	200K	Expensive
	OpenAI	o3	Text	$10.00	$40.00	200K	Expensive
	OpenAI	o3 Mini	Text	$1.10	$4.40	200K	Cheapest
	OpenAI	o4 Mini	Text	$1.10	$4.40	200K	Cheapest
	OpenAI	Codex Mini	Text	$1.50	$6.00	200K	Mid
	OpenAI	GPT 4 Turbo	Text	$10.00	$30.00	128K	Expensive
	OpenAI	GPT 4o Search Preview	Text	$2.50	$10.00	128K	Mid
	OpenAI	GPT 4.5 (Preview)	Text	$75.00	$150.00	128K	Expensive
	OpenAI	GPT 3.5 Turbo 16k	Text	$0.50	$1.50	16.4K	Cheapest
	OpenAI	o1 mini	Text	$1.10	$4.40	128K	Cheapest

GPT 4o mini

(gpt-4o-mini)

Cheapest

Text

GPT 4o miniBy OpenAI

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than GPT-3.5 Turbo. It maintains SOTA intelligence, while being significantly more cost-effective. GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferences common leaderboards. Check out the launch announcement to learn more.

$0.15Input(Per Million)

$0.60Output(Per Million)

128KContext Window

GPT 4o mini (2024 07 18)

(gpt-4o-mini-2024-07-18)

Cheapest

Text

GPT 4o mini (2024 07 18)By OpenAI

GPT-4o mini is OpenAI's newest model after GPT-4 Omni, supporting both text and image inputs with text outputs. As their most advanced small model, it is many multiples more affordable than other recent frontier models, and more than 60% cheaper than GPT-3.5 Turbo. It maintains SOTA intelligence, while being significantly more cost-effective. GPT-4o mini achieves an 82% score on MMLU and presently ranks higher than GPT-4 on chat preferences common leaderboards. Check out the launch announcement to learn more.

$0.15Input(Per Million)

$0.60Output(Per Million)

128KContext Window

GPT 4o

(gpt-4o)

Mid

Text

GPT 4oBy OpenAI

GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot"

$2.50Input(Per Million)

$10.00Output(Per Million)

128KContext Window

GPT 4o (2024 08 06)

(gpt-4o-2024-08-06)

Mid

Text

GPT 4o (2024 08 06)By OpenAI

The 2024-08-06 version of GPT-4o offers improved performance in structured outputs, with the ability to supply a JSON schema in the respone_format. Read more here: https://openai.com/index/introducing-structured-outputs-in-the-api/. GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities. For benchmarking against other models, it was briefly called "im-also-a-good-gpt2-chatbot"

$2.50Input(Per Million)

$10.00Output(Per Million)

128KContext Window

ChatGPT 4o

(chatgpt-4o-latest)

Expensive

Text

ChatGPT 4oBy OpenAI

OpenAI ChatGPT 4o is continually updated by OpenAI to point to the current version of GPT-4o used by ChatGPT. It therefore differs slightly from the API version of GPT-4o in that it has additional RLHF. It is intended for research and evaluation. OpenAI notes that this model is not suited for production use-cases as it may be removed or redirected to another model in the future.

$5.00Input(Per Million)

$15.00Output(Per Million)

128KContext Window

GPT 4o (2024 11 20)

(gpt-4o-2024-11-20)

Mid

Text

GPT 4o (2024 11 20)By OpenAI

The 2024-11-20 version of GPT-4o offers a leveled-up creative writing ability with more natural, engaging, and tailored writing to improve relevance & readability. It's also better at working with uploaded files, providing deeper insights & more thorough responses. GPT-4o ("o" for "omni") is OpenAI's latest AI model, supporting both text and image inputs with text outputs. It maintains the intelligence level of GPT-4 Turbo while being twice as fast and 50% more cost-effective. GPT-4o also offers improved performance in processing non-English languages and enhanced visual capabilities.

$2.50Input(Per Million)

$10.00Output(Per Million)

128KContext Window

GPT 4.1

(gpt-4.1)

Mid

Text

GPT 4.1By OpenAI

GPT-4.1 is a flagship large language model optimized for advanced instruction following, real-world software engineering, and long-context reasoning. It supports a 1 million token context window and outperforms GPT-4o and GPT-4.5 across coding (54.6% SWE-bench Verified), instruction compliance (87.4% IFEval), and multimodal understanding benchmarks. It is tuned for precise code diffs, agent reliability, and high recall in large document contexts, making it ideal for agents, IDE tooling, and enterprise knowledge retrieval.

$2.00Input(Per Million)

$8.00Output(Per Million)

1MContext Window

GPT 4.1 Mini

(gpt-4.1-mini)

Cheapest

Text

GPT 4.1 MiniBy OpenAI

GPT-4.1 Mini is a mid-sized model delivering performance competitive with GPT-4o at substantially lower latency and cost. It retains a 1 million token context window and scores 45.1% on hard instruction evals, 35.8% on MultiChallenge, and 84.1% on IFEval. Mini also shows strong coding ability (e.g., 31.6% on Aider's polyglot diff benchmark) and vision understanding, making it suitable for interactive applications with tight performance constraints.

$0.40Input(Per Million)

$1.60Output(Per Million)

1MContext Window

GPT 4.1 Nano

(gpt-4.1-nano)

Cheapest

Text

GPT 4.1 NanoBy OpenAI

For tasks that demand low latency, GPT‑4.1 nano is the fastest and cheapest model in the GPT-4.1 series. It delivers exceptional performance at a small size with its 1 million token context window, and scores 80.1% on MMLU, 50.3% on GPQA, and 9.8% on Aider polyglot coding – even higher than GPT‑4o mini. It's ideal for tasks like classification or autocompletion.

$0.10Input(Per Million)

$0.40Output(Per Million)

1MContext Window

GPT 3.5 Turbo

(gpt-3.5-turbo)

Cheapest

Text

GPT 3.5 TurboBy OpenAI

GPT-3.5 Turbo is OpenAI's fastest model. It can understand and generate natural language or code, and is optimized for chat and traditional completion tasks. Training data up to Sep 2021.

$0.50Input(Per Million)

$1.50Output(Per Million)

16.4KContext Window

o1

(o1)

Expensive

Text

o1By OpenAI

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 model series is trained with large-scale reinforcement learning to reason using chain of thought. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology.

$15.00Input(Per Million)

$60.00Output(Per Million)

200KContext Window

o3

(o3)

Expensive

Text

o3By OpenAI

o3 is a well-rounded and powerful model across domains. It sets a new standard for math, science, coding, and visual reasoning tasks. It also excels at technical writing and instruction-following. Use it to think through multi-step problems that involve analysis across text, code, and images. Note that BYOK is required for this model.

$10.00Input(Per Million)

$40.00Output(Per Million)

200KContext Window

o3 Mini

(o3-mini)

Cheapest

Text

o3 MiniBy OpenAI

OpenAI o3-mini is a cost-efficient language model optimized for STEM reasoning tasks, particularly excelling in science, mathematics, and coding. This model supports the `reasoning_effort` parameter, which can be set to "high", "medium", or "low" to control the thinking time of the model. The default is "medium". OpenRouter also offers the model slug `openai/o3-mini-high` to default the parameter to "high". The model features three adjustable reasoning effort levels and supports key developer capabilities including function calling, structured outputs, and streaming, though it does not include vision processing capabilities. The model demonstrates significant improvements over its predecessor, with expert testers preferring its responses 56% of the time and noting a 39% reduction in major errors on complex questions. With medium reasoning effort settings, o3-mini matches the performance of the larger o1 model on challenging reasoning evaluations like AIME and GPQA, while maintaining lower latency and cost.

$1.10Input(Per Million)

$4.40Output(Per Million)

200KContext Window

o4 Mini

(o4-mini)

Cheapest

Text

o4 MiniBy OpenAI

OpenAI o4-mini is a compact reasoning model in the o-series, optimized for fast, cost-efficient performance while retaining strong multimodal and agentic capabilities. It supports tool use and demonstrates competitive reasoning and coding performance across benchmarks like AIME (99.5% with Python) and SWE-bench, outperforming its predecessor o3-mini and even approaching o3 in some domains. Despite its smaller size, o4-mini exhibits high accuracy in STEM tasks, visual problem solving (e.g., MathVista, MMMU), and code editing. It is especially well-suited for high-throughput scenarios where latency or cost is critical. Thanks to its efficient architecture and refined reinforcement learning training, o4-mini can chain tools, generate structured outputs, and solve multi-step tasks with minimal delay—often in under a minute.

$1.10Input(Per Million)

$4.40Output(Per Million)

200KContext Window

Codex Mini

(codex-mini-latest)

Mid

Text

Codex MiniBy OpenAI

codex-mini-latest is a fine-tuned version of o4-mini specifically for use in Codex CLI. For direct use in the API, we recommend starting with gpt-4.1.

$1.50Input(Per Million)

$6.00Output(Per Million)

200KContext Window

GPT 4 Turbo

(gpt-4-turbo)

Expensive

Text

GPT 4 TurboBy OpenAI

The latest GPT-4 Turbo model with vision capabilities. Vision requests can now use JSON mode and function calling. Training data: up to December 2023.

$10.00Input(Per Million)

$30.00Output(Per Million)

128KContext Window

GPT 4o Search Preview

(gpt-4o-search-preview)

Mid

Text

GPT 4o Search PreviewBy OpenAI

GPT-4o Search Preview is a specialized model for web search in Chat Completions. It is trained to understand and execute web search queries.

$2.50Input(Per Million)

$10.00Output(Per Million)

128KContext Window

GPT 4.5 (Preview)

(gpt-4.5-preview)

Expensive

Text

GPT 4.5 (Preview)By OpenAI

GPT-4.5 (Preview) is a research preview of OpenAI's latest language model, designed to advance capabilities in reasoning, creativity, and multi-turn conversation. It builds on previous iterations with improvements in world knowledge, contextual coherence, and the ability to follow user intent more effectively. The model demonstrates enhanced performance in tasks that require open-ended thinking, problem-solving, and communication. Early testing suggests it is better at generating nuanced responses, maintaining long-context coherence, and reducing hallucinations compared to earlier versions. This research preview is intended to help evaluate GPT-4.5's strengths and limitations in real-world use cases as OpenAI continues to refine and develop future models.

$75.00Input(Per Million)

$150.00Output(Per Million)

128KContext Window

GPT 3.5 Turbo 16k

(gpt-3.5-turbo-0125)

Cheapest

Text

GPT 3.5 Turbo 16kBy OpenAI

The latest GPT-3.5 Turbo model with improved instruction following, JSON mode, reproducible outputs, parallel function calling, and more. Training data: up to Sep 2021. This version has a higher accuracy at responding in requested formats and a fix for a bug which caused a text encoding issue for non-English language function calls.

$0.50Input(Per Million)

$1.50Output(Per Million)

16.4KContext Window

o1 mini

(o1-mini)

Cheapest

Text

o1 miniBy OpenAI

The latest and strongest model family from OpenAI, o1 is designed to spend more time thinking before responding. The o1 models are optimized for math, science, programming, and other STEM-related tasks. They consistently exhibit PhD-level accuracy on benchmarks in physics, chemistry, and biology. Learn more in the launch announcement. Note: This model is currently experimental and not suitable for production use-cases, and may be heavily rate-limited.

$1.10Input(Per Million)

$4.40Output(Per Million)

128KContext Window

...

Showing 1-20 of 265 models